Solving Atomic Multicast When Groups Crash

نویسندگان

  • Nicolas Schiper
  • Fernando Pedone
چکیده

In this paper, we study the atomic multicast problem, a fundamental abstraction for building fault-tolerant systems. In the atomic multicast problem, the system is divided into non-empty and disjointgroups of processes. Multicast messages may be addressed to any subset of groups, each message pos-sibly being multicast to a different subset. Several papers previously studied this problem either in localarea networks [3, 9, 20] or wide area networks [13, 21]. However, none of them considered atomicmulticast when groups may crash. We present two atomic multicast algorithms that tolerate the crash ofgroups. The first algorithm tolerates an arbitrary number of failures, is genuine (i.e., to deliver a messagem, only addressees of m are involved in the protocol), and uses the perfect failures detector P . We showthat among realistic failure detectors, i.e., those that do not predict the future, P is necessary to solve gen-uine atomic multicast if we do not bound the number of processes that may fail. Thus, P is the weakestrealistic failure detector for solving genuine atomic multicast when an arbitrary number of processes maycrash. Our second algorithm is non-genuine and less resilient to process failures than the first algorithmbut has several advantages: (i) it requires perfect failure detection within groups only, and not across thesystem, (ii) as we show in the paper it can be modified to rely on unreliable failure detection at the cost ofa weaker liveness guarantee, and (iii) it is fast, messages addressed to multiple groups may be deliveredwithin two inter-group message delays only.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault-Tolerant Total Order Multicast to Asynchronous Groups

While Total Order Broadcast (or Atomic Broadcast) primitives have received a lot of attention, this paper concentrates on Total Order Multicast to Multiple Groups in the context of asynchronous distributed systems in which processes may suffer crash failures. “Multicast to Multiple Groups” means that each message is sent to a subset of the process groups composing the system, distinct messages ...

متن کامل

Genuine atomic multicast in asynchronous distributed systems

This paper addresses the problem of atomic multicasting messages in asynchronous distributed systems. Firstly, we give a characterization of the notion of genuine atomic multicast. This characterization leads to a better understanding of the di3erence between atomic multicast and atomic broadcast, and to make a clear distinction between genuine atomic multicast algorithms and non-genuine atomic...

متن کامل

Consensus-Based Fault-Tolerant Total Order Multicast

ÐWhile Total Order Broadcast (or Atomic Broadcast ) primitives have received a lot of attention, this paper concentrates on Total Order Multicast to Multiple Groups in the context of asynchronous distributed systems in which processes may suffer crash failures. aMulticast to Multiple Groupso means that each message is sent to a subset of the process groups composing the system, distinct message...

متن کامل

Genuine Atomic Multicast

This paper addresses the problem of atomic multicasting messages in asynchronous systems with unreliable failure detection. First, we give a characterisation of the notion of algorithms tolerating unreliable failure detection. Second, we give a characterisation of the notion of genuine atomic multicast. This characterisation leads to a better understanding of the diierence between atomic multic...

متن کامل

Genuine versus Non-Genuine Atomic Multicast Protocols

In this paper, we study atomic multicast, a fundamental abstraction for building fault-tolerant systems. We suppose a system composed of data centers, or groups, that host many processes connected through high-end local links; a few groups exist, interconnected through high-latency communication links. In this context, a recent paper has shown that no multicast protocol can deliver messages add...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008